Consistent delay of reward vs consistent nonreward in the alley

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the washback effect of discretepoint vs. integrative tests on the retention of content in knowledge tests

در این پایان نامه تاثیر دو نوع تست جزیی نگر و کلی نگر بر به یادسپاری محتوا ارزیابی شده که نتایج نشان دهندهکارایی تستهای کلی نگر بیشتر از سایر آزمونها است

15 صفحه اول

Consistent absence of BRAF mutations in salivary gland carcinomas

Introduction: Malignant salivary gland tumors are rare entities. Despite advances in surgery, radiation therapy and chemotherapy, the rate of the mortality and five-year survival has not been improved markedly over the last few decades. The activation of EGFR- RAS-RAF signaling pathway contributes to the initiation and progression of many human cancers, promising a key pathway for therapeutic m...

متن کامل

Consistent and Coherent Learning with δ -delay

A consistent learner is required to correctly and completely reflect in its actual hypothesis all data received so far. Though this demand sounds quite plausible, it may lead to the unsolvability of the learning problem. Therefore, in the present paper several variations of consistent learning are introduced and studied. These variations allow a so-called δ –delay relaxing the consistency deman...

متن کامل

A Self-Consistent Technique for the Construction and Evaluation of the Three-Parameter Corresponding States Principles

A self-consistent approach for the evaluation of the existing three-parameter corresponding states principles of non-polar fluids and the calculation of the corresponding states parameters is presented. This self consistent approach is based upon the assumption that the contribution of the third parameter to the thermophysical properties is much smaller than the contributions of the first two p...

متن کامل

Inverse Reinforcement Learning with Locally Consistent Reward Functions

Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated trajectory to be produced by only a single reward function. This paper presents a novel generalization of the IRL problem that allows each trajectory to be generated by multiple locally consistent reward functions, hence catering to more realistic and complex experts’ behaviors. Solving our generali...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Psychonomic Science

سال: 1967

ISSN: 0033-3131

DOI: 10.3758/bf03327856